A multi-dimensional indexing approach for timestamped event sequence matching

نویسندگان

  • Sanghyun Park
  • Jung-Im Won
  • Jeehee Yoon
  • Sang-Wook Kim
چکیده

This paper addresses the problem of timestamped event sequence matching, a new type of similar sequence matching that retrieves the occurrences of interesting patterns from timestamped sequence databases. The sequential-scan-based method, the trie-based method, and the method based on the iso-depth index are well-known approaches to this problem. In this paper, we point out their shortcomings, and propose a new method that effectively overcomes these shortcomings. The proposed method employs an R*-tree, a widely accepted multi-dimensional index structure that efficiently supports timestamped event sequence matching. To build the R*-tree, this method extracts time windows from every item in a timestamped event sequence and represents them as rectangles in n-dimensional space by considering the first and last occurring times of each event type. Here, n is the total number of disparate event types that may occur in a target application. To resolve the dimensionality curse in the case when n is large, we suggest an algorithm for reducing the dimensionality by grouping the event types. Our sequence matching method based on the R*-tree performs with two steps. First, it efficiently identifies a small number of candidates by searching the R*-tree. Second, it picks out true answers from the set of candidates. We prove its robustness formally, and also show its effectiveness via extensive experiments. 2007 Elsevier Inc. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Index-Based Method for Timestamped Event Sequence Matching

This paper addresses the problem of timestamped event sequence matching, a new type of sequence matching that retrieves the occurrences of interesting patterns from a timestamped event sequence. Timestamped event sequence matching is useful for discovering temporal causal relationships among timestamped events. In this paper, we first point out the shortcomings of prior approaches to this probl...

متن کامل

یک روش مبتنی بر خوشه‌بندی سلسله‌مراتبی تقسیم‌کننده جهت شاخص‌گذاری اطلاعات تصویری

It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...

متن کامل

Video indexing based on object motion for video-surveillance context

We have developed an algorithm for automatic indexing of video surveillance sequence. The indexing consists in the recognition of specific scenarios defined by a human operator. The principal utility concerns the off-line retrieval of specific behaviour from a long stored image sequence. From a video indexing point of view, image acquisition is made in a favourable condition: fixed acquisition ...

متن کامل

Segregating event streams and noise with a Markov renewal process model

We describe an inference task in which a set of timestamped event observations must be clustered into an unknown number of temporal sequences with independent and varying rates of observations. Various existing approaches to multi-object tracking assume a fixed number of sources and/or a fixed observation rate; we develop an approach to inferring structure in timestamped data produced by a mixt...

متن کامل

Video Data Indexing by 2D C-Trees

Video data contains a large amount of spatial and temporal information. The changes of video frames are quite useful for motion analysis and cannot be provided by other media easily. Chang et al. had proposed an effective 2D string approach for spatial indexing of image data. In this paper, we extend this iconic approach and apply it to video data indexing. We propose 2D C-trees to represent th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Sci.

دوره 177  شماره 

صفحات  -

تاریخ انتشار 2007